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Abstract — Mean value and cross-covariance function of chaotic 
binary sequences are evaluated for chaotic maps with specific 
properties. We also take into account the effect of fixed- and 
floating-point representations on statistical properties of chaotic 
generators. Thus, one is able to obtain possible candidates for 
pseudo-random binary sequences generation. Results of statistical 
tests applied to chaotic binary sequences are presented. The 
chaotic binary sequences thus obtained could be used for security 
improvement in IEEE 802.11 standard. 

Index Terms — Chaos, Cryptography, Random number genera- 
tion, Binary sequences 



I. Introduction 

Pseudo-random binary sequences have apphcations in se- 
veral domains including information theory (informational 
sources), spread-spectrum systems (spreading sequences) and 
cryptography (stream cyphers, key generation, initialization 
vectors, etc.). 

Chaotic generators were already studied for cryptographic 
applications HI, 12, iS due to their specific properties: 
sensitivity to initial conditions, random-like behavior, non- 
linear dynamics, etc. As already noticed in the main 
difficulty when using chaotic generators in cryptography is 
the fact that chaotic generators take values from a continuous, 
infinite space, while classical encryption techniques operate on 
a discrete, finite space. However, since most chaotic generators 
are implemented using a finite precision (fixed- or floating- 
point), the conversion from the continuous space to the discrete 
space is realized implicitly 13|. Further conversion techniques 
allow to pass from the discrete space to a binary space thus 
obtaining chaotic binary sequences 14j. 

In order to obtain pseudo-random binary sequences, pre- 
viously proposed approaches use a bit extraction technique 
so that one or several bits are extracted from each chaotic 
value. However, there are few studies related to the selection 
of chaotic maps with good statistical properties and how the 
bit extraction technique must be designed |3|, |4|. These are 
the main topics aborded in our paper. The originality of our 
approach resides on the fact that we take into account the 
effect of fixed- and floating-point representations on statistical 
properties of chaotic generators. 

The rest of the paper is organized as follows. In section 
mi the theoretical background used in our developments is 
presented. The definitions for the mean value and cross- 
covariance function of chaotic binary sequences are given. 
Subsections III- Al and III-BI give simplified expressions of mean 
value and cross-covariance function, respectively, for chaotic 
maps with specific properties. Thus one is able to obtain 



possible candidates for pseudo-random binary sequences ge- 
neration. Results of statistical tests applied to chaotic binary 
sequences are presented in section HU] Finally some conclu- 
sions are given in section HVl 

II. Mean value and cross-covariance function of 

CHAOTIC binary SEQUENCES 

We consider chaotic sequences {xn} generated by a fun- 
ction (chaotic map) r : [0, 1] [0, 1]: 



'(Xn-l) 



(1) 



where n e {1,2,...}. Each chaotic value, a;„, and the 
parameters of the chaotic map are represented using fixed- 
or floating-point representation. 

Any chaotic map, defined on some interval [a,b], f : 
[a,b] [a,b], can be redefined on the interval [0,1], using 
the tranformation: 

/(a + (b — a)x) — a 



t{x) 



(2) 



The choice of the [0, 1] interval allows the simplification of 
our mathematical developments. 

Starting from a given value x G [0, 1] one can obtain the 
bits of the fractionary part using a sum of threshold functions 
il: 



2'-l 



b,{x)=J2i-iy-'a^{x) 



(3) 



i E {1,2,.. .}. When a fixed-point representation is used for 
X, the threshold function at (x) has the expression: 



a-t{x) 




(4) 



and, when a floating-point representation is used, the threshold 
function is: 

, , jo ,^<t 



(5) 



where e is the non-biased exponent of the floating-point value 
X 0. 

The binary sequence obtained using i e {1,2, . . .} 

is called a chaotic binary sequence if x is obtained from a 
chaotic generator ([T]i. In the following we are interested to 
determine under which conditions the chaotic binary sequence 
is pseudo-random. 

For this purpose, the following statistical measures of the 
chaotic binary sequence are defined H: 
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1) mean value 



< bi >= 



bi{x)f* (x)dx 



(6) 



2) cross-covariance function 



<~p'-'^{l-b,.bj)>= / {b,{x)-<b,>) 
Jo 

{b,ir'{x))~<b,>) r{x)da 



(7) 



where f*{x) is the density function of the chaotic map r ||6l 
p. 8] and t^x) represents the output of the chaotic map after 
/ iterations ([TJ: t''{x) = t{t{- ■ ■ t(.t))). 

Note that, according to the above definitions, the density 
function is used as a probabihty density function. Thus, (|6]l 
and (|2l) are the classical definitions of the mean and cross- 
covariance function of continuous random variables. In our 
case, the realization of the random variable is the chaotic value 
Xn, but we use a binary function bi{x) ^ in order to extract 
some binary information. 

It is worth to emphasize that, in order to integrate in (|6]l 
and (|7]i, the value x must be continuous, that is represented 
with infinite precision. This can be accomplished when the 
number of bits of the fractionary part in fixed- or floating- 
point representations is infinite. Thus, we generalize the fixed- 
and floating-point representations in order to represent any 
number in the interval [0, 1]. This approach allows to simplify 
the mathematical developments and can be seen as an ideal 
case of chaotic binary sequences obtained from chaotic values 
represented with finite precision. 

Using ^ one is able to evaluate the mean of the binary 
sequence obtained from the i-th bit of each chaotic value 
generated with ([T]). In order to ensure the pseudo-randomness 
of the chaotic binary sequence, a necessary condition is to 
have an (almost) equal number of zeros and ones Q, that is 
the mean value must be (|6]i: 



< bi >= 0.5 



(8) 



for any i in a given ensemble. Thus one is able to find which 
bits must be extracted from the fractionary part of the chaotic 
value in order to obtain a balanced number of zeros and ones. 

Further, the cross-covariance function (|2| can be used to 
evaluate the correlations between the i-th and j-th bits belon- 
ging to different chaotic values obtained from ([T]i, separated 
by I iterations. Since a non-zero cross-covariance function will 
imply the statistical dependence of bits, we are interested to 
determine when the cross-covariance function is (close to) zero 



(9) 



So, conditions (O and (|9]l can be used to select from all 
available chaotic maps potential candidates for pseudo-random 
binary sequences generation. 

In the next two subsections analytical expressions for the 
mean value and cross-covariance function will be deduced for 
chaotic maps with specific properties. 



A. Mean value 

Starting from the definition of the mean value ^ and using 
^ the mean value can be rewritten as: 



2'-l 



<6, ^(_i)'-V( ) 



where 



Pr{t) 



(Tt{x)f*{x)dx 



(10) 



(11) 



If the threshold function at {x) is defined under fixed-point 
representation (|4]i, then Pr(i) ( fTTT i can be written as: 



Pr{t) 



rix)dx 



(12) 



and if (Jt{x) is defined under floating-point representation, 
then: 

^J*{x)dx 

r{x)dx 

iin + '= 

,i„+fc f*{x)dx 



Jr. 



Prit) = < 



t2=mi„-t 

E 

fc=i 



(13) 



.S/c=l It 



t2= 



,i > 0.5 



where emin < is the minimum value of the exponent e (for 
example, in floating-point double precision e^nin = —1022 
|8|). In order to obtain ( fTsT l the interval [0, 1] is divided into 
sub-intervals defined by a constant exponent e Q and the 
definition of the threshold function crt(a;) (|5j is used. 

Note that the expression of the mean value ( fTOb depends 
only of the density function, f*{x), and the representation 
used for the chaotic map, at{x). Thus a whole class of chaotic 
maps having the same density function can be characterized 
using dTol l. 

To further simplify the expression of the mean value (fTOl i the 
density function f*{x) must be specified. We have considered 
two examples of density functions: 



and 



nx) = 1 
1 



(14) 
(15) 



TT^y — x) 
corresponding to Bernoulli map 

t{x) = mod(px, 1) (16) 
where p is the parameter of Bernoulli map and logistic map 

t{x) = 4a;(l - x) (17) 

respectively ID. 

Thus, it can be shown that when the density function is 
given by (fT4l) . under fixed-point representation, the mean value 
is: 

<5i>=0.5 (18) 
i G {1,2,.. .} and under floating-point representation: 

= l 

,i e {2,3,...} 



< b,, >= 




(19) 
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In order to prove ( fTSl ) and ( fT9] l see appendix. 

The result given by ( fT9] l can be explained by the fact that, 
when a floating-point representation is used, the first bit i — 
1 is the hidden bit of the fractionary part (significand) Q. 
Since the probability to have this bit set (normalized numbers) 
is much greater than the probability to have this bit unset 
(denormalized numbers) we obtain: < 61 1. 

Following ( fTSl l and ( fT9l ) one can conclude that, when the 
density function is given by ( fT4l ). all bits of the fractionary 
part (except the hidden bit) can be used to obtain balanced 
chaotic binary sequences. 

When the density function is given by (fTSb . under fixed- 
point representation, the mean value is: 



2'-l 



0.5 - i ^(-l)'-iasir 



and under floating-point representation: 
'0.5- iasin (2'=«»"+i - 1) 



1 ^ 

-^(-l)'--iasin (r2^" 



-j+2 



r=2>-i + l 



where 



p^{t) = - 2° (asin (2'='"'"+^+i - 1) 
fe=i 

- asin (i2'='"'"+'''+i - l)) Vi > 0.5 
In order to prove ( |20l l and (ISTT l see appendix. 



(20) 



,1 > 1 



(21) 



(22) 



Numerical evaluation of relations (I20l l and (l2Tl l shows that, 
when the density function is given by ( fTsT l. under fixed- 
point representation, all bits of the fractionary part can be 
used to obtain balanced chaotic binary sequences (Fig. [Tal l. 
However, under floating-point representation, the mean value 
only converges to 0.5 as the bit index i (l2Tl i increases (Fig.fTbli. 
So, in this case, the first bits of the fractionary part must be 
discarded and thus a less efficient bit extraction technique must 
be used in order to obtain balanced chaotic binary sequences. 

So, using the mean value of the chaotic binary sequence, one 
is able to determine the bits of the fractionary part needed to 
obtain chaotic binary sequences with an equal number of zeros 
and ones. In order to gain further insights into the statistical 
properties of chaotic binary sequences, higher order statistics 
must be used. In the following subsection the cross-covariance 
function is evaluated in order to characterize the correlations 
at the bit level in chaotic binary sequences. 

B. Cross-covariance function 

In order to simplify the expression of the cross-covariance 
function (|7]i, the chaotic map t{x) is chosen so that Q: 

\glix)\ri9,ix)) = -^rix) VI < * < TV. (23) 

where gi{x) = t^^{x), g[{x) is the first derivative of gi{x), 
Ti{x) is defined on a subinterval [di^i,di] C [0,1] so that 



Ti{x) is invertible, Ti{x) = t{x) Vx G and A^,- 

is the number of subintervals in [0, 1] on which t{x) is 
invertible. Among the chaotic maps verifying ( |23] ) one can 
mention Bernoulli map, tent map, logistic map and Chebyshev 
polynomials |4|. 

Using ( I23] ) the cross-covariance function can be written as 
|4|: 



2'-l 2^-1 



< p(^)(0;6„6,) > = E E ("1)''^' [Pr i^^ax 



2« 21 



(24) 



2* -1 2^-1 



nLiSign(.'(.-^(J))) (25) 
(p.(max(r'(J),^)) 



yi > 0, where Pr(i) has the expression given by (fTTT l. 

The cross-covariance function, (l24l i and dZSl l. depends not 
only of the density function, /* (x), and the representation used 
for the chaotic map, at{x), but also of the chaotic map itself, 
t{x), and the sign of its derivative, sign (r' (a;)). Note also 
that the cross-covariance function ( l25T l is inverse proportional 
with the number of subintervals on which t{x) is invertible, 
Nr- Thus, in order to have a cross-covariance function with 
values near to zero, the chaotic map must be chosen so that 
Nt is large enough. For example, the Bernoulli map ( fT6] l with 
p = 1.99 has Nr = 2, but when p = 3.99 we have Nr = 4. 

Numerical evaluation of ( l25T l for Bernoulli map under 
floating-point representation is given in Fig |2a] and |2b] It can 
be seen that, when the number of iterations / is fixed, the 
correlations between the bits of different chaotic values can 
be reduced by choosing the parameter of the Bernoulli map 
so that Nr is large enough. 

However, the expression of the cross-covariance function is 
too complex to provide insights into statistical properties of 
chaotic maps without full specification of the chaotic map. 
Since our goal is to find pseudo-random generators, we have 
found more useful to direclty apply statistical tests Q on 
chaotic binary sequences. The results of the application of 
statistical tests are given in the following section. 

III. Simulation results 

In our simulations we use the chaotic maps defined by (fTSl ) 
and ( fTTI l, the tent map 

t{x) = p{l - \1 - 2x\) (26) 

where p = 0.99 and the Chebyshev polynomial of third order 

Tp{x) = i2x - l)(2Tp_i(a;) - 1) + 1 - Tp^2{x) (27) 
where ti{x) = x, to{x) = 1 and p = 3. 
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0.1 - 



10 12 14 16 18 20 
bit index 



(a) 




(b) 



Fig. 1. Mean value corresponding to tlie density function given by il5i under [(a)] fixed-point representation |(b)| floating-point representation 





(a) 



(b) 



Fig. 2. Cross-covariance function for I = 5 corresponding to the Bernoulli map )16) under floating-point representation with|(a)|p = 1.99|(b)|p = 3.99 



Since under a finite representation (fixed- or floating-point) 
the chaotic sequences are periodic, we use in our simula- 
tions initial conditions generating chaotic sequences with the 
longest period |9|. The parameters used in our simulations 
are synthesized in table |I1 The fixed-point representation on 
32 bits is defined by one bit for the sign, 3 bits for the 
integer part and 28 bits for the fractionary part. The floating- 
point representation on 32 bits is the IEEE754 floating-point 
single-precision representation fSJ. We use also the floating- 
point double-precision representation with the same initial 
conditions as for single-precision (this choice is due to the 
fact that in double-precision finding the longest period takes 
a much longer time than for single-precision). From each 
chaotic value, represented with finite precision, the bits of 
the fractionary part (except the hidden bit when floating-point 
representation is used) are extracted and thus chaotic binary 
sequences are obtained. 

Using chaotic sequences represented with floating-point 
double precision, we have numerically computed the proba- 



chaotic map 


representation 


initial condition 


period len. 


Bernoulli map 
(p = 3.99) 


fixed (32 bits) 
float (32 bits) 


0.58268645778298 . . . 
0.24101102352142 . . . 


3050 
3827 


tent map 


fixed (32 bits) 
float (32 bits) 


0.0453341640532 . . . 
0.4008057713508 . . . 


583 
4311 


logistic map 


fixed (32 bits) 
float (32 bits) 


0.0009547546505 . . . 
0.99998587369918 . . . 


13404 
930 


Chebyshev pol. 
of third order 


fixed (32 bits) 
float (32 bits) 


0.99994068592786 . . . 
0.05464851856231 . . . 


10014 
1821 



TABLE I 
SIMULATION PARAMETERS 



bility density function for each chaotic map. Thus, we have 
found that Bernoulli map and tent map belong to the same 
class of chaotic maps having the density function given by 
(fl4l i. Logistic map and Chebyshev polynomial of third order 
have a density function close to ( fTSl ). So, the results obtained 
for the mean value using ( fT4l i and (fTsT i are appliable for tent 
map and Chebyshev polynomial of third order, respectively. 
In Fig. [Ill] we have computed the number of zeros and 
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Fig. 3. Number of zeros and ones of chaotic binary sequences obtained from several chaotic maps 



ones of chaotic binary sequences with 10^ bits. Thus, we 
have found that logistic map and Chebyshev polynomial of 
third order under single-precision floating-point representation 
generate chaotic binary sequences with a greater number of 
ones than zeros. This result is in agreement with the theoretical 
result obtained for the mean value (|2TI ) (Fig. [Tbb. Using the 
same representation, Bernoulli map and tent map have an 
almost equal number of zeros and ones (T9[ . When the 
fixed-point representation is used, all considered chaotic maps 
generate balanced chaotic binary sequences. The differences 
with respect to theoretical results, ( fTSl ), ( fT9] l and ( |20] |. are due 
to the fact that, in simulation, we use a finite precision, while 
in theory we have supposed that the number of bits of the 
fractionary part is infinite. However, if the number of bits of 
the fractionary part is large enough, we find a good agreement 
between theory and simulation. 

We have also applied all 16 statistical tests described in 
Q on chaotic binary sequences generated with the above 
described chaotic maps. We have used 100 binary sequences 
of 10^ bits each. The best results were obtained with Bernoulli 
map (Fig. Ha]) and tent map (Fig. l4b] ) under floating-point 
double precision representation. 

Thus, the theoretical results given in subsections III-AI and 
III-BI have allowed to select chaotic maps suitable for pseudo- 
random sequences generation. Further, one can select the 
bits to be extracted from each chaotic value for pseudo- 
random sequences generation. Bernoulli map and tent map 
were selected since chaotic maps with density function given 
by (fT4l) generate balanced chaotic binary sequences using 
all bits of the fractionary part. Note that the parameter of 



Bernoulli map p ~ 3.99 was chosen based on the fact that the 
cross-covariance function has smaller values when Nr = 4. 

By simulation, we have noticed that the period length of 
chaotic sequences has an impact on the results of statistical 
tests. The longer the period is, the better are the results 
of statistical tests. This observation could provide another 
explanation why the best results were obtained using double- 
precision floating-point representation (Fig. I4alandl4bli. 

Instead of selecting initial conditions generating chaotic 
sequences with the longest period, one can use arbitrary 
initial conditions and the perturbations technique |10|. Our 
simulations have shown that, with respect to our method, 
the perturbations technique brings little improvements of the 
results of statistical tests. However, the perturbations technique 
represents a practical approach for the implementation of 
chaotic pseudo-random generators e.g. in the Wired Equivalent 
Privacy (WEP) protocol of IEEE 802.11 standard fTTl|. 

The WEP protocol is particulary interesting for the ap- 
plication of chaotic generators since the initialization vector 
(initial condition) is often changed. Thus, short length periods 
or fixed points, specific for chaotic maps represented with 
finite precision, can be avoided. We expect that security 
improvements in IEEE 802.11 standard could be achieved 
by replacing the RC4 pseudo-random generator with chaotic 
binary generators. 

IV. Conclusion 

The mean value and cross-covariance function of chaotic 
binary sequences were analytically computed. The considered 
chaotic sequences were represented using fixed- and floating- 
point representations. We have shown that chaotic binary 
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sequences can produce binary sequences with an equal number 
of zeros and ones if the density function is f*{x) ~ 1. 
Further, these chaotic binary sequences were obtained using 
all the bits of the fractionary part of the chaotic value. This 
bit extraction technique is highly efficient with respect to 
previously proposed methods. Improvements of correlation 
properties of chaotic binary sequences are possible by increas- 
ing the number of subintervals, Nr, on which the chaotic map 
is invertible. Using these results, possible candidates (chaotic 
maps) for pseudo-random binary sequences generation can be 
selected. We have also evaluated, by means of statistical tests, 
chaotic binary sequences thus obtained. 

Further research needs to be conducted for chaotic maps un- 
der fixed-point representation. We have considered a relatively 
short fixed-point representation on 32 bits, but we expect that 
increasing the precision in this case could provide also good 
pseudo-random binary sequences. Another interesting research 
direction is to find out how much precision is needed in 
order to obtain chaotic binary sequences with good statistical 
properties. Also, it should be interesting to develop statistical 
measures by taking into account the finite representation of 
chaotic values, that is by replacing in the definitions of the 
mean value and cross-covariance function the integral by a 
sum. 

A possible application of chaotic binary sequences could be 
the WEP protocol of IEEE 802.11 standard by replacing the 
RC4 pseudo-random generator with chaotic binary generators. 
However, further cryptanalysis research should be done in 
order to prove the validity of this approach. 



Appendix 



i. Mean value for 

Fixed-point representation 
Knowing that ( fT2] l: 



we get ( [Tol l: 



With 



<5, >=^(-ir-i(i-^) 



^(-irv = (-ir-i 



and ^ we get ( fTsT l. 

Floating-point representation 
Knowing that ( fTsT l: 



Prit) 



we get ([Tol l: 

if i = 1 



if i > 1 



2em.„ (1 _ 2t) + 1 - 2'=-" ,t<0.5 
2(l-t)(l-2'=""") ,i>0.5 



2'" 



<6,> = 5](-ir-V(^) 

+ x:' (-i)^-v(j) 

r=2'-i + l 

_ 1 
~ 2 

So we have completely proved (fT9] l. 
ii. Mean value for f*{x) 

The following result is used: 



7T^ X{l — x) 



- dx — — asin (2x — 1) 

Tr^yx{l — x) TT 



Fixed-point representation 
With ^ and (O we get: 



(29) 



(30) 



(31) 

(32) 

(33) 

(34) 
(35) 



(36) 



Prit) 



1 -t 



(28) 



Pr{t) = 0.5 asin(2t - 1) 



(37) 
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So (QOli: 

<h >=0.5-- V(-l)'-^ 



2'-l 



asm 



- 1 



(38) 



Floating-point representation 
with ( |36] l and ( fT3] l we get: 



'0.5- iasin(t2'=™+2 - 1) ,i<0.5 



Pr(i) = < 



So (Unii: 

If i = 1 



, -emi„ 



fc=i 



(asin (2 



- 1) (39) 
' ,t>0.5 



-asin (i2 



emin + fc+l 



1)) 



< >=Pr(^) = 0.5 - -asin(2^"""+i - 1) (40) 

2 TT 



If i > 1 



2'" 



2'-l 



<6,:> = 5:(-irv(:^)+ E (-irv(:^) 



r=2»-i + l 



(41) 



2'" 



= (^2'=-"-*+2 - 1) (42) 

(43) 



2'-l 



+ 5: (-i)'^-v(^) 

r=2*-i + l 

So we have completely proved ( 1211 1. 
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